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Abstract—Service composition is an important research prob- 
lem in service computing systems, which combines simple and 
individual services into composite services to fulfill users’ complex 
requirements. Service composition usually consists of four stages, 
i.e., service discovery, candidate selection, service negotiation 
and task execution. In self-organising systems, there is the fifth 
stage of service composition: self-evolution. Most of existing 
works study only some of the five stages. However, these five 
stages should be systematically studied so as to develop an 
integrated and efficient service composition approach. Against 
this background, this paper proposes an agent-based integrated 
self-evolving service composition approach. This approach sys- 
tematically takes the five stages of service composition into 
consideration. It is also decentralised and self-evolvable. Exper- 
imental results demonstrate that the proposed approach can 
achieve almost the same success rates while uses much less 
communication overhead and time consumption in comparison 
with three existing representative approaches. 


Keywords - Service Composition, Self-organisation, Multi- 
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I. INTRODUCTION 

With the development of computing, service-oriented com- 
puting becomes a new paradigm for engineering modern 
complex systems which are composed of various services for 
users’ consumption [1]. Due to the increase of the complexity 
of users’ requirements, individual services sometimes cannot 
fulfill users’ requirements. Service composition becomes a 
necessary and effective technique to satisfy users’ complex 
requirements. Service composition enables simple and indi- 
vidual services to be dynamically combined into composite 
services by coordinating service providers and configuring 
existing services so as to meet users’ complex requirements 
[2]. Currently, service composition has been widely studied in 
many systems, e.g., cloud systems [3], multi-agent systems 
[4], mobile ad hoc networks [5] and sensor networks [6]. 
Specifically, service composition is an aggregation of atomic 
or composite services which collaborate to implement a set of 
operations in order to carry out a complex task or a business 
process [7]. 

In the last decade, a number of service composition ap- 
proaches have been proposed. Most of these approaches are 
centralised [8], [9], while only a few of them are decentralised 
[10], [11]. In these centralised approaches, there is a central 
manager or controller who has the full knowledge of the en- 
vironments and is responsible to organise service composition 
for consumers in the environments. The centralised approaches 
are very efficient in small scale environments. However, they 
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incur computation bottleneck and suffer the single point of 
failure. They are also inapplicable in some environments where 
there is not a central manager, e.g., sensor networks [6]. The 
decentralised approaches overcome the computation bottle- 
neck and the single point of failure drawback of the centralised 
approaches by enabling a service consumer to autonomously 
organise service composition. However, some decentralised 
approaches [12], [13] assume that each service consumer has 
the full knowledge of the environment. This assumption is 
infeasible in some large-scale environments, e.g., multi-cloud 
environments [14]. Other decentralised approaches [10] waive 
this assumption by modeling a networked environment, where 
each participant has a set of neighbours or acquaintances 
and each participant has knowledge only about its neighbours 
or acquaintances. These decentralised approaches, however, 
overlook the evolution of the operating environments, where 
participants may change the acquaintance relationship between 
each other over time and service providers may buy extra ser- 
vices from other participants or sell infrequently used services 
to other participants. Such evolution exists in a number of 
real world systems, e.g., multi-agent systems [15], [16], social 
networks [17] and peer-to-peer systems [18]. Therefore, to 
adapt to such evolving environments rapidly, it is important 
to develop a service composition approach which can self- 
evolve over time. Moreover, service composition consists of 
five stages: service discovery, candidate selection, service 
negotiation, task execution and self-evolution, whereas most 
of the existing approaches study only some of the five stages 
[19], [5], [20], [21]. To develop an integrated and efficient 
service composition approach, all the five stages should be 
systematically studied. 

Based on this background, in this paper, an agent-based 
integrated self-evolving service composition approach is pro- 
posed. The proposed approach tackles two research challenges 
of service composition. First, self-evolution is an important 
property of self-organising systems [22], [16]. However, self- 
evolution in service composition has not been addressed prop- 
erly in existing approaches [19]. Second, as described above, 
service composition consists of five stages which should be 
systematically considered to achieve an integrated service 
composition approach. Existing approaches, however, consider 
only some of the five stages [5]. 

In summary, the advantages of the proposed approach are 
as follows. 

1) The proposed approach is decentralised which can avoid 
the computation bottleneck and the single point of failure and 
does not need global information. 

2) The existing approaches, which consider self-evolution, 
study service migration only [19], whereas the proposed 
approach studies both service migration and relation mod- 
ification. Service migration is for service exchange among 
participants in order to increase utilisation of services. Relation 


modification enables participants to remove existing and/or 
add new acquaintances so as to build neighbourhood with other 
useful participants. Both of service migration and relation 
modification are important features of self-organising systems 
[22], [16]. 

3) Unlike the existing approaches which consider only some 
of the five stages of service composition [5], the proposed 
approach is the first integrated one which combines all the 
five stages. 

4) The proposed approach is developed in a general environ- 
ment instead of specific systems, e.g., mobile ad hoc networks 
[20]. This means that the proposed approach could be applied 
in various systems. 

The rest of the paper is organised as follows. Section II 
presents related studies about service composition. Section 
HI provides a motivating example of our research. Section 
IV describes the detail of our integrated self-evolving service 
composition approach. Experiments and the corresponding 
analysis are given in Section V. Section VI concludes the paper 
and points out the future research directions. 


II. RELATED WORK 

Various approaches have been proposed to handle the ser- 
vice composition problem [8], [9]. Most of these approaches 
are centralised [23], [1], [24], [25], [14]. 

Ardagna and Pernici [1] formalised the service composition 
problem as a mixed integer linear programming problem. They 
used loops peeling in the optimisation and took constraints 
posed by stateful services into consideration. In their approach, 
there is a centralised Concretisator module which selects the 
best concrete services to be invoked from a service registry 
for each task of the composed service according to user’s 
requirements and the optimisation criteria. 

Kurdi et al. [14] developed a centralised combinatorial 
optimisation algorithm for service composition in multi-cloud 
environments. Their algorithm includes a cloud combiner and 
a service composer. The cloud combiner is used to select 
the appropriate cloud combination from the multi-cloud en- 
vironment. The service composer takes the cloud combination 
as input and generates a final service composition sequence 
which meets user’s request. 

The centralised approaches are very efficient in small scale 
environments. However, as all the computation and commu- 
nication are handled by a central manager, the centralised 
approaches may incur computation and communication bottle- 
necks and suffer the single point of failure. In order to over- 
come the drawbacks of centralised approaches, decentralised 
approaches have also been proposed. 

Moustafa and Zhang [12] proposed a reinforcement learn- 
ing approach for multi-objective service composition and 
adaptation in dynamic uncertain environments. They used 
multi-objective Markov decision process to model the service 
composition problem. Then, two algorithms were devised 
to handle single policy and multiple policy multi-objective 
service composition based on user preferences. The solution 
of each of the two algorithms is a procedure that indicates 
how an agent selects a service in each state. By using the two 
algorithms, an agent can find a set of optimal workflows which 
fulfill the trade-offs among multiple QoS objectives. 

Wang et al. [26] proposed a reinforcement learning approach 
for large scale and adaptive service composition. They mod- 
eled the service composition problem as a multi-dimensional 


transition graph and developed a multi-agent learning algo- 
rithm to handle the problem. The solution of their algorithm is 
a procedure that indicates how a team of agents selects services 
in each state. Unlike Moustafa and Zhang’s single agent 
learning algorithm, Wang et al.’s algorithm is a multi-agent 
one which can perform better than a single agent learning 
algorithm. 

Moustafa et al. [11] proposed a stigmergy-based approach to 
model service interactions and handle service composition. In 
their approach, interactions among service agents are indirect 
by leaving and sensing artificial pheromone. Such pheromone 
encodes specific information which is used to achieve service 
composition. Also, they developed a trust model for service 
selection based on the balance between the trust ranks of the 
concrete services and the trust ranks of the whole workflow. 

These decentralised approaches distribute the decision pro- 
cess about service composition to each service consumer. 
Thus, the computation bottleneck and the single point of fail- 
ure can be avoided. These decentralised approaches, however, 
have to use global information for service composition. For 
example, a service consumer is allowed to select services in 
the whole environment, which implies that a service consumer 
knows all the service providers in the environment. The use 
of global information is infeasible in large environments. In 
order to avoid using global information, some decentralised 
approaches were developed in networked environments, where 
each participant has a set of acquaintances and each participant 
has knowledge only about these acquaintances. 

Sim [3] proposed an agent-based paradigm for cloud re- 
source management which includes cloud search, cloud com- 
merce and cloud service composition. In his approach, each 
participant has a service capability table which records a 
list of neighbouring participants and their services. Then, a 
consumer selects services for composition by consulting its 
service capability table to determine to which participants it 
should send its request message. Through this way, the number 
of messages exchanged among participants in the system can 
be significantly reduced compared to those approaches which 
broadcast messages to all the participants in the system. 

Gutierrez-Garcia and Sim [10] devised a semi-recursive 
contract net protocol enhanced with service capability tables. 
The protocol is based on recursive calls of the contract net 
protocol [27] which is a distributed problem solving tech- 
nique used for achieving service agreements among service 
consumers and service providers. By using the protocol, a 
service consumer can efficiently find feasible service providers 
which are capable of carrying out a given task. 

These decentralised approaches were designed based only 
on local information, so they can be used in large scale 
environments. However, these approaches do not consider 
the multiple stages of service composition, including the 
evolution of environments, where participants may change 
the acquaintance relationship between each other over time 
and participants may buy extra services from others or sell 
infrequently used services to others. Some other decentralised 
approaches take multiple stages of service composition into 
consideration but they still have some limitations or differences 
compared to our approach. 

Prinz et al. [19] studied dynamic service composition in 
P2P systems by introducing logical peer groups. They then 
proposed an approach which enables runtime composite ser- 
vice reconfiguration including service exchange among peers. 


The formation of a logical service group is carried out by 
an initial coordinator which collects information about all 
peers required sub-service. During execution, new peers may 
arrive and peers’ execution properties may vary. Then, sub- 
services may be exchanged among peers to guarantee the 
proper execution of the system. Their approach, however, is 
based on global information, while our approach is based 
only on local information. Moreover, self-evolution in their 
approach considers service exchange only, while self-evolution 
in our approach considers both service exchange and relation 
modification. 

Cardellini et al. [21] proposed a decentralised service 
composition approach by using a gossip protocol to sup- 
port information dissemination and decision making. In their 
approach, each peer has three components: network latency 
estimator, workflow manager and gossip manager. Network 
latency estimator estimates the network delay between pairs of 
peers in order to meet QoS requirements. Workflow manager is 
responsible for initialising the composition of a new workflow 
or for repairing an existing workflow. Gossip manager is 
responsible for decentralised information dissemination and 
decision making so as to achieve dynamic service composition 
and adaptation of workflows. Their approach is fully decen- 
tralised which is based only on local information. However, 
their study focuses on service discovery and selection while 
overlooks other stages. In comparison, our approach considers 
all the five stages. 

Groba and Clarke [20] presented a service composition 
protocol which opportunistically allocates and invokes service 
providers. They modeled a service composite as a directed 
acyclic graph and a service provision as a time-varying graph. 
Then, service composition is to find a mapping between 
the directed acyclic graph and the sequence of time-varying 
graph such that the edges between two services are enabled 
for the time they interact. Their protocol is independent of 
network topology and can reduce failure probability. Their 
study is different from ours in the following aspects. First, 
their protocol is developed in dynamic ad hoc environments, 
while our approach is designed in a general environment, 
which means that our approach could be applied in various 
environments. Second, for service composition, their protocol 
mainly focuses on three stages: service discovery, service 
selection and service execution (known as service allocation 
and invocation in the paper)', while our approach studies all 
the five stages. Third, in their protocol, the service composition 
problem is modeled as a graph mapping problem, while in our 
approach, the service composition problem is modeled as a 
multi-agent resource allocation problem. Since the models and 
environments are different, the solutions are entirely different. 

Chen and Clarke [5] proposed a decentralised service 
composition model, where a system dynamically adapts its 
business process by composing its fragments on-demand to 
meet the constraints of service providers and consumers. 
Their model matches L (L < 1) service queries to M 
(M < 1) services by adapting abstract workflows and their 
concrete implementation details. Their work is carried out in 
ad hoc environments and focuses on two stages of service 


'Groba and Clarke [20]’s protocol also considers other two stages: synchro- 
nisation and communication. The two stages have to be considered in ad hoc 
environments but are not necessary in general environments. If our approach 
is applied to ad hoc or other communication environments, these two stages 
will be taken into account. 


composition: service discovery and service selection, while 
our work is conducted in general environments and focuses 
on all the five stages of service composition. 

Compared with existing approaches, our approach is de- 
centralised, uses only local information and takes multiple 
stages, including self-evolution, of service composition into 
consideration. 


II. A MOTIVATING EXAMPLE 





Fig. 1. A motivating example 


This section provides an application example from target 
tracking in wireless sensor networks (WSNSs). It should be 
noted that the target tracking in WSNs is an application 
example only. The proposed service composition approach will 
be developed in a general environment rather than any specific 
networks or systems. 

In Fig. 1, each circle represents a sensor node. Suppose 
that sensor 1 has detected a target and needs to identify 
and track the target. To identify and track a target, sensor 1 
needs two services: identification service and tracking service. 
Sensor 1, however, does not have such two services and 
needs to ask other sensors in the network for help. This is 
a wireless sensor network, so sensor 1 has knowledge only 
about its neighbouring sensors, i.e., sensor 2 and sensor 3. 
Sensor 1 will inquire sensor 2 and sensor 3 that if they 
can provide any of the services. In the case that sensor 2 
and sensor 3 cannot provide the required services, they will 
inquire their neighbours. This process will continue until the 
pre-set TTL (Time To Live) value for the discovery of each 
service is reached (stage 1). After the search process, several 
service providers may be found for each service request. Then, 
sensor 1 will select providers for the service requests based 
on some criteria (stages 2 and 3). After execution of these 
services (stage 4), sensor 1 will evolve (stage 5). Sensor 1 
may remove less efficient neighbours or may add efficient 
sensors as new neighbours. In this example, the removed or 
added neighbours are logic neighbours rather than physical 
neighbours, because physical neighbours are formed based on 
sensors sensing ranges and thus cannot be changed arbitrarily. 
In WSNs, sensors can change physical neighbours by adjusting 
their sensing ranges. The detailed discussion of discovery and 
change of physical neighbours in WSNs is out of the scope 
of this paper. Interested readers can refer to [28]. 

Specifically, the service workflow of the example tracking 
network is given in Fig. 2. As stated in [29], [30], there are 
roughly seven types of services used in a tracking network. 5} 
is the target discovery service which is used for discovering 
that a target appears in the network. Ss is the target detection 
service which is used to locate the position of the target. S3, 
S4 and S; constitute the target identification service, where S3 
is the target feature extraction service, S4 is the target feature 


classification service, and S; is the target feature recognition 
service. Sg is the target tracking service which is used to chase 
the target. Sg may be provided by multiple service providers, 
i.e., sensors, as the target may travel in the network and a 
sensor is not powerful enough to cover the area that the target 
travels through. S7 is the processing and reporting service 
which is used to process the collected information and report 
the outcome to the user. 


Fig. 2. The service workflow 


In this application example, the WSN is a decentralised net- 
work. Thus, those centralised service composition approaches 
are most likely inadequate here. Also, in WSNs, the com- 
munication capability of sensors is very limited [31], [32]. 
Thus, those decentralised service composition approaches, 
which have to use global information, are also most likely 
inadequate, as the acquisition of global information involves 
large overhead of communication. Finally, as WSNs are self- 
organising systems [33], those decentralised service com- 
position approaches, which do not consider the evolution 
of environments, are again most likely inadequate here ei- 
ther. Therefore, a decentralised service composition approach, 
which considers multiple stages of service composition and 
uses little communication overhead, is necessary and the aim 
of this paper is to develop such an approach. 


IV. THE INTEGRATED SELF-EVOLVING APPROACH DESIGN 

In this section, we first formulate the problem. Then, an 
overview of the proposed approach is given. Finally, the details 
of the proposed approach are presented. 


A. Problem Formulation 


The proposed approach is developed in a distributed 
networked environment which is defined as an acquaintance 
network? in Definition 1. 


Definition 1: (Acquaintance Network). The acquaintance 
network is defined as an undirected graph G = (V,E). 
V = {v1,...,Un} is the set of participants in the network. 
E CV x V is the set of edges which link participants. Two 
participants, v; and vj, are acquaintances of each other if 
(vi, vj) € E. Each participant, v;, has an acquaintance set 
N; = {v;| (vi, vj} € E}. In addition, N} = N; U {v;} is the 
acquaintance set which includes v; itself. Each participant 
can directly interact only with those participants in its 
acquaintance set. 


Each participant, v;, in the acquaintance network has a 
set of services recorded as S;. Each participant has three 


?As this paper aims at developing a service composition approach, the 
detailed discussion of establishment and maintenance of networks is out of 
the scope of this paper. Interested readers can refer [34], [35], [36] for the 
establishment and maintenance of networks. 


possible roles: consumer, broker and provider which are 
defined in Definition 2. 


Definition 2: (Consumer, Broker and Provider). When 
a participant initialises a service request, this participant is 
called a service consumer. When a participant provides a 
service or services to a consumer, this participant is called a 
service provider. When a participant receives a commitment 
from a consumer to find proper providers, this participant is 
called a broker. 


According to Definition 2, the role of a participant is 
based on a specific service request. As each participant has 
a set of services, a participant can be a consumer, a broker 
and a provider simultaneously. For example, a participant, 
vi, needs a service, s1, so for service s1, V; is a consumer. 
Meanwhile, v; provides service s2 to participant vj, so for 
service s2, v; is a provider. Also, v; has a commitment from 
participant v; for service s3 to find a proper provider, so for 
service s3, v; is a broker. As these processes can happen 
concurrently, v; can be a consumer, a broker and a provider 
simultaneously. 

The service composition problem in this paper is described 
in Definition 3. 


Definition 3: (Service Composition). In an acquaintance 
network, when a consumer has a complex task, T, which 
needs a set of services to fulfill, T < {s1,..., Sm}, then how 
to find and select proper providers for these services is called 
service composition in this paper?. 


In Definition 3, the number of required services of a 
complex task depends on the type of the task. For example, 
in Section HI, the target chasing task needs five services: 
the target discovery service, the target detection service, the 
target identification service, the target tracking service, and 
the processing and reporting service. Furthermore, the target 
identification service can be considered as a sub-task which 
needs three services: the target feature extraction service, 
the target feature classification service and the target feature 
recognition service. As our approach is developed in a general 
environment, we do not set a specific number of services for 
a complex task. Instead, we use a variable, m, to represent 
the number of required services. 


B. Overview 


To make participants autonomous, each participant is 
equipped with an intelligent agent which behaves on behalf of 
the participant. Then, various intelligent agent technologies, 
e.g., multi-agent learning, multi-agent trust management and 
multi-agent negotiation, can be used. The use of agent tech- 
nologies in service composition can overcome the limitations 
in traditional service composition approaches [11]. Here, an 
intelligent agent is an entity which can make rational deci- 
sions autonomously in a dynamic environment. An intelligent 
agent blends pro-activeness and re-activeness, shows rational 


3 Accurately, service composition should include the building up process of 
a composite service. The building up process of a composite service, however, 
depends on specific applications. As the proposed service composition ap- 
proach is developed in a general environment, we do not consider the building 
up process of a composite service in this paper. 


commitments to decision making and exhibits flexibility when 
facing an uncertain and changing environment [37]. 

When a consumer has a complex task to fulfill, the con- 
sumer agent starts the service composition using the proposed 
approach. The service composition approach consists of the 
following five stages. 

1) Service discovery: For each of the required service, the 
consumer agent searches the acquaintance network directly or 
via its acquaintances. 

2) Candidate selection: After search, the consumer agent 
uses a trust model to select the candidate service providers. 

3) Service negotiation: After selection, the consumer agent 
negotiates with these candidate provider agents in order to 
achieve service agreements for the required service. 

4) Task execution: Then, the consumer agent chooses a set 
of service agreements for the required service. In the set of 
service agreements, one service is used to carry out the task 
and other services are used as backup in case that the service 
in use unexpectedly fails. 

5) Network evolution: Finally, the consumer agent evaluates 
its current status, based on historical data, to make decisions 
that whether it needs to modify relationships with its acquain- 
tances and whether it needs to buy or sell some services. 

The mechanism or algorithm for each stage will be de- 
scribed in detail in the following sub-sections. 


C. Stage 1: service discovery 


The service discovery process is based only on local infor- 
mation, which is described in Algorithm 1 as follows. 


Algorithm 1: Service discovery process 
1 for each participant vi € V do 


2 for each service request, sx, in task T of vi do 

3 CandListi(sx) + ; 

4 if any acquaintance vj, where vj € NY, can provide 
service spk then 

5 CandList;(s,) < CandList;(s,) U {v;}; 

6 while TT L(s,) > 0 do 

7 select m acquaintances as brokers; 


if any acquaintance vj of these brokers can 
provide service są then 
9 | CandList; (sx) + CandListi (sz) U {uj}; 


10 | TTL(sk) — TTL(se) — 1; 











For each service request są of a participant v;, vj creates 
a candidate list CandList;(s;,) and initialises the list as an 
empty set (Lines 2 and 3). First, v; inquires its acquaintances 
whether they have the service sẹ (Line 4). Those participants 
which have the service są will be added into the candidate 
list CandList;(s;) (Line 5). Then, v; selects m acquaintances 
from N; as brokers and commits the selected acquaintances to 
find candidates for service s (Line 7). This selection is based 
on the performance of v;’s acquaintances, which is described 
in Algorithm 2. Each broker inquires its acquaintances about 
service są and replies v; which acquaintances have service sę 
(Line 8). These acquaintances, which have service sz, will be 
added in the candidate list CandList;(s;,) (Line 9). After- 
wards, each broker continues the search process by selecting 
m acquaintances as new brokers until TTL(s;,) reaches 0. 
Here, local information means that each participant knows 
only its neighbours, i.e., acquaintances. If a participant wants 


to have more information, it has to inquire its neighbours. 
In Algorithm 1, the loop is used by participants to obtain 
more information. The loop is controlled by a parameter, TT L, 
which is set by users, to avoid infinite search in the network. 
Therefore, how many participants can be searched in a loop 
entirely depends on the value of TT L. Different services may 
have different values of TT L. For example, important services 
can be set a relatively larger TTL compared to less important 
services so as to have more providers found. 


Algorithm 2: Broker selection 


1 \* Initialisation * \ 
2 for each acquaintance v; € N; do 
3 Let the performance of vj be Q(v;); 





4 Let the probability of selecting v; as a broker be p(v;); 

5 The probability distribution over acquaintances is recorded 
as P = (p(v1), .--, p(vjn;,|)), where |N;| means the number 
of acquaintances of v; and ) 7) <;<)y,) P(vj) = 1 and 
0 < p(v;) < 1; 

6 v; initialises each Q(v;) = 0 and p(v;) = AB 


7 \* Selection and adjustment * \ 

8 v; selects m acquaintances as brokers based on probability 
distribution P; 

9 Each acquaintance of v; is evaluated and assigned a reward rj; 

10 F EA Lojen; rj; 

11 for each acquaintance vj E€ N; do 

12 | Q(vj) — (1 —a)Q(v;) +a- rj; 

plv) = plv) + C(Q(v;) — 7); 
14 P © Normalise(P); 


In Algorithm 2, v; first initialises the performance of each 
acquaintance and the probability of selecting each acquain- 
tance (Lines 2-6). Because initially, v; is not aware of its 
acquaintances’ performance, it simply sets the performance 
of each acquaintance to Q(v;) = 0 and sets the probability of 
selecting each acquaintance equally to p(v;) = INT: Then, v; 
selects m acquaintances as brokers based on probability dis- 
tribution P (Line 8). For example, v; has three acquaintances 
and the probability distribution over the three acquaintances 
is P = (0.2,0.3,0.5). Then, v; has the probability 0.2 to 
select the first acquaintance, the probability 0.3 to select the 
second acquaintance and the probability 0.5 to select the third 
acquaintance, as a broker. 

When these brokers finish their job, v; evaluates the perfor- 
mance of each acquaintance and assigns each acquaintance 
a reward r; (Line 9). For those acquaintances which are 
not selected as brokers, they are assigned a reward 0. For 
the brokers, the reward of a broker is based on how many 
candidates it has found for v;’s task T. The more candidates 
a broker has found, the more reward the broker is assigned. 
For example, suppose a threshold is set as 3. Then, if a broker 
has found 5 candidates, this broker is assigned a reward 2, 
where 2 = 5 — 3; whereas if another broker has found only 
2 candidates, this broker is assigned a reward —1, where 
=1=2—3. 

After the assignment of reward to each acquaintance, v; 
adjusts the performance of each acquaintance and the prob- 
ability of selecting each acquaintance (Lines 10-13). In Line 
12, v; adjusts the performance of each acquaintance, Q(v;), 
based on the acquaintance’s current performance and the 
reward, rj, assigned during the service discovery for task T, 
where 0 < a < 1 is the learning rate which is a constant 


of deadline, the consumer’s reserved price for the service, the 
probability of reaching an agreement and the competition of 
the required resource. 

e accept{o]. When a provider receives a quote o, it can 
accept the quote, which results in a temporary agreement made 
with the consumer. 

e counter_quote[o']. If a provider is not happy with a quote 
o, it can send back a counter-quote o’ for its service. A counter- 
quote o’ is determined by three aspects which include the 
provider’s reserved price for the service, the probability of 
reaching an agreement for the serivce and the competition of 
the service. 

e cancel{o|]. After a temporary agreement is achieved by 
a consumer and a provider, any one of them can cancel 
the temporary agreement without paying penalty. However, 
cancellation penalty applies on a backup agreement and a final 
agreement. 

The negotiation protocol is described in Algorithm 4. 


Algorithm 4: Service negotiation 


1 \* Suppose v; is a consumer and v; is a provider * \ 
2 while t <the deadline of the task do 





3 vi generates a quote o to vj; 

4 if vj accepts o then 

5 AT (vi) & AT (vi) U {o}; 

6 AT (uj) + AT (v;) U {o}; 

7 return; 

8 else 

9 vj generates a counter-quote o’ to vi; 
10 if v; accepts o' then 

u AT (vi) — AT (vi) U {0'}; 
12 AT (vj) 4 AT (vj) U {0'}; 
13 return; 

14 else 

15 continue; 








In Algorithm 4, before the deadline of the task, a consumer 
and a provider can alternately propose quotes and counter- 
quotes until an agreement is reached. In Line 3, consumer v; 
generates a quote o to provider vj, where o = (pr, pe). Each 
of the elements in o can be calculated using the following 
equations. 





t—arrive(sk) o(t) . T = 
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In Equation 2, pr is the price that v; quotes to vj. res_pry, 
is the reserved price of v; for the requested service. arrive(sp) 
is the arrival time of service request s, while start(s;,) is 
the latest start time of s. t means current round. 6(t) is 
the difference of v;’s quoting prices between the last round, 
t — 1, and the current round, t, while A(t) is the difference 
of v;’s quoting prices between the initial round, 0, and the 
current round, t. In Equation 2, it can be seen that the quoting 
price is determined by the v;’s reserved price, res_pry,, the 
left time till the deadline (i.e., the pressure of deadline), 

fear) the number of temporary agreements that 


start(s,)—arrive(sz)’ 
v;i has achieved üe., the probability of reaching an agreement), 





| AT (vi)|, and the concession that the negotiating partner vj 
has made (i.e., the competition of the required resource), OE 

In Equation 3, pe is the basic penalty and exe,, means the 
total time steps, during which v; needs to use v;’s service 
to execute the task. The actual penalty that v; has to pay vj 
depends on how long v; has used v,;’s service. For example, 
suppose that pr = 100 and exe,, = 20, so pe = =?" = 5. 


eLey. 
Now, v;i has used v;’s service for 12 time steps and vi wants 
to cancel the agreement (i.e., stopping using v;’s service), then 
vi has to pay vj penalty 60, where 60 = pe x 12. 

After receiving an offer o from v;, in Line 4, vj evaluates 
whether the offer o is acceptable. This evaluation is based on 
how much revenue vj could get (Equation 4), where cost is 
the cost that v; spends to provide the requested service. 





ru = pr — cost 


(4) 


If the revenue, rv, is larger than a threshold, v; accepts 
quote o and a temporary agreement is achieved (Lines 5 and 6). 
Otherwise, vj generates a counter-quote o' = (pr’, pe’) (Line 
9). The calculation of pr’ is similar to Equation 2 except that 
provider v; does not have a deadline for its service, while the 
calculation of pe’ is the same as Equation 3. Then, Lines 10- 
12, if v; accepts the counter-quote, a temporary agreement is 
achieved. Otherwise, the negotiation goes into the next round. 

At the end of this stage, for each service request sx, 
consumer v; may have achieved a set of temporary agreements, 
where each temporary agreement is signed with an individual 
provider. Consumer v; selects a provider v; to provide service 
Ss, for task execution. The temporary agreement between 
v; and vj now becomes a final agreement. Consumer v; 
also selects several providers for backup and the temporary 
agreements between v; and these backup providers become 
backup agreements. To achieve a backup agreement, v; has 
to pay the corresponding backup provider a non-refundable 
deposit. The amount of deposit can be set as equal to the basic 
penalty. Finally, v; cancels the rest of temporary agreements. 


F. Stage 4: task execution 


As described above, after Stage 3, consumer v; has achieved 
a set of temporary agreements for each service request and v; 
has to select one provider from the temporary agreements for 
each service request. In this stage, an Integer Programming 
(IP) based method [41], [23] is employed by consumer v; to 
select one provider for each service request for task execution. 
For an IP problem, there are three inputs: a set of variables, a 
set of constraints and an objective function. Both the objective 
function and the constraints must be linear. The aim of IP is to 
maximise or minimise the objective function by adjusting the 
values of the variables under the given constraints. The output 
of an IP problem is the maximum (or minimum) value of the 
objective function and the values of variables along with that 
maximum (or minimum) value. 

The problem of selecting a provider for each service request 
can be modeled as an IP problem as follows. We first introduce 
a binary indicator, £jķ, where x;, = 1 if provider vj is 
selected for service request s and xj; = 0 otherwise. We also 
introduce another binary indicator, y,, where y = 1 if service 
request są is critical and yẹ = 0 otherwise. Each service 
request, s, has a start time, start. A provider, vj, needs 
execution duration time, durjķ, to execute service request 
Sp with price prj and reliability relij. The constraints are 
formally expressed as follows. 


ye 


1<jS|AT (v:)| 


jp = 1,Vk € [0, m] (5) 


Constraint 1 (Equation 5) means that for each service 
request sk, consumer selects one and only one provider, vj, 
to provide the service for task execution. 


startı — (durpk + startk) < 0, Vsk > s1 A k,l € [0,m] (6) 
Constraint 2 (Inequality 6) indicates that if service request 


sı is a direct successor of Sx, 1.€., Sk —> Sı, service request s; 
must be executed after the execution of sz. 


start; = starty, YSk © sı A k,l € [0, m] (7) 


Constraint 3 (Equation 7) indicates that service requests s; 
and s; have to be executed concurrently. 


II relit > Dy,0<D, <1 (8) 

1<k<m 
Constraint 4 (Inequality 8) indicates that the overall relia- 
bility of the combined service should be larger than or equal 


to a threshold D,. This constraint is not linear but it can be 
converted to a linear one by using the logarithm function ln: 


D i<ck<m Yk In(relijn) > In(D1). 
5 dur, < Dz (9) 


1<k<m 


Constraint 5 (Inequality 9) indicates that the overall duration 
of task execution should be less than or equal to a threshold 


Də. 
5 Prik < D3 


1<k<m 


(10) 


Constraint 6 (Inequality 10) indicates that the overall price 
to execute the task should be less than or equal to a threshold 
D3. The values of thresholds, Dı, Dz and D3, are associated 
with specific tasks. 

The objective function is then given in the Maz function 
(11) which implies that the overall reliability should be max- 
imised while the overall execution duration time and the total 
price should be minimised. w1, w2 and w3 are the weights of 
reliability, execution duration and price, respectively, where 
0 < w; < 1 and S raz w; = 1. The values of weights, w1, 
wə and w3, are associated with specific consumers. 


Maz(wı 5 Yk ` In(reli;,) — we 5 dur, — w3 X prix) 
k=1 k=1 k=1 
(11) 


G. Stage 5: network evolution 


After Stage 4, when the task execution is completed, 
consumer v; may want to modify relationships with its ac- 
quaintances. v; may remove less efficient acquaintances or 
may add efficient participants as its new acquaintances. Also, 
consumer v; may buy extra services if v; has often used 
them, or v; may sell some of existing services if v; or other 
participants have not used them for a long time. The aim 
of removing less efficient acquaintances is that v; can save 
communication overhead in the future, because v; does not 
need to communicate with these less efficient acquaintances 
any more. Similarly, the aim of adding efficient participants as 
new acquaintances is that in the future v; can avoid brokers 
and directly communicate with these new acquaintances for 
service composition. Thus, communication overhead can be 


saved. The aim of buying/selling services is to save v;’s service 
usage/service management cost so as to improve v;’s overall 
utility. 

The three problems, namely modifying relationships, buying 
services and selling services, can be modeled as a decision 
making problem, i.e., whether or not to do some thing based 
on some observation. The decision making process of v; 
modifying relationship with v; is described as follows. The 
decision making process of other problems is similar. 

Suppose that vj is one of v;’s acquaintances. Every time, 
when v,;’s task execution is completed, v; has two options, 
i.e., whether it should keep v; as an acquaintance or not. Let 
po be the probability of keeping vj as an acquaintance and 
pı be the probability of removing v; from v;’s acquaintance 
list. Also, let Q(ob(t),0) be the expected reward of keeping 
vj as an acquaintance and @(ob(t), 1) be the expected reward 
of removing vj from v;’s acquaintance list, where ob(t) is 
v;'8 observation to vj till time step t. Initially, pọ = 1, pı = 
0, Q(ob(t),0) = 0 and Q(ob(t), 1) = 0. Then, after a task 
execution, t+ 1, Q(ob(t+1), 0) is updated using Equation 12, 
where Ro is immediate reward obtained by v; for keeping vj 
as an acquaintance during this task execution, and y, 0 < y < 
1, is a constant learning rate. 


Q(ob(t+1), 0) = Q(ob(t), 0) +po-7-(Ro—Q(ob(t),0)) (12) 


Here, immediate reward, Ro, is determined based on 1) the 
number of services that v; provides v; and 2) the number 
of proper providers that v; has found for v; during this task 
execution. Then, after the calculation of Q(ob(t + 1), 0) using 
Equation 12, po and pı are adjusted using Equation 13, where 
e 0 <e€< 1, is a greedy coefficient. 

= po + €: Q(ob(t + 1), 0) (13) 
pi =1—po 


After the probability adjustment, function Normalise() 
(Algorithm 2) is again used to project po and pı to a valid 
probability distribution: 0 < po,p1 < 1 and pp + pı = 1. Fi- 
nally, v; selects an option based on the probability distribution, 
po and pı. If v; selects 1, namely that v; decides to remove 
vj from its acquaintance list, the above process terminates. 

For the two other problems, buying/selling services, the 
process is similar to the above process. If v; decides to 
buy or sell a service, v; can use the methods proposed in 
Stages 1, 2 and 3, where in Stage 1, v; searches potential 
sellers/buyers; then in Stage 2, v; uses the trust model to select 
trustworthy sellers/buyers; finally, in Stage 3, v; negotiates 
with these candidates and buys/sells services from/to the 
candidates which can maximise v,’s utility. 

As our approach is developed in a general environment, 
the application of it in a specific environment, e.g., sensor 
networks, is not studied in this paper. The application of 
our approach in specific environments needs corresponding 
modification. For example, if the approach is applied in 
sensor networks, as a sensor network is a resource-constrained 
environment, buying/selling services among sensors must be 
realised by using energy-efficient resource sharing techniques 
[42]. 


H. Discussion 


In the proposed approach, it is assumed that for each service 
type, there is only one candidate service. Generally, for each 


service type, a provider may have different services which may 
have different property values, e.g., different response times 
and different prices. In this paper, as the approach consists of 
five stages, for the ease of presentation and understanding, it is 
assumed that for each service type, there is only one candidate 
service and each provider may or may not offer the candidate 
service. However, the prices of the candidate services provided 
by different providers may be different, because the prices of 
the candidate services are negotiable (refer to Section IV-E). 
Moreover, in the proposed approach, it is not assumed that the 
quality of the candidate services offered by different providers 
is identical. Thus, the negotiable prices imply that the quality 
of the candidate service offered by different providers may be 
different. Therefore, the proposed approach can accommodate 
multiple candidate services, although in the current version, 
the multiple candidate services are not taken into account. The 
evaluation of the proposed approach with multiple candidate 
services is one of our future studies. 


V. EXPERIMENT AND ANALYSIS 

In this section, the proposed integrated self-evolving service 
composition approach, named as Self-Evo, is evaluated* in 
comparison with three other related and representative ap- 
proaches which are described as follows. 

1) Mixed Integer Programming (MIP) [1]: MIP is an ef- 
ficient and well-known approach for service composition in 
a centralised manner [23], [1]. Service composition problem 
can be mapped to an MIP problem by setting a set of 
constraints, defining a set of variables and establishing an ob- 
jective function. The MIP problem is then solved by a central 
manager. In this experiment, the central manager is a simulated 
entity which is not a participant of the network. The central 
manager can directly communicate with any participants in 
the network. As the simulated network is a general network, 
specific communication medium is not considered. 

2) Multi-Agent Reinforcement Learning (MARL) [12], [26]: 
MARL is a subdiscipline of multi-agent techniques. In MARL, 
each agent learns by trial-and-error interaction with a dynamic 
environment. Agents can autonomously adjust their behaviour 
based on the feedback from the environment so as to achieve 
better results in the future. Thus, using MARL, systems in- 
herently have good self-adaptability. In existing MARL based 
service composition approaches, users are modeled as agents 
and services are modeled as actions, where in different states, 
users take different actions and obtain different rewards. Based 
on the rewards, users adjust the probability distribution over 
their actions, i.e., services, for service selection in the future. 
However, although agents can make decisions individually 
without central control, agents still need global information. 

3) Service Capability Table based approach (SCT) [10]: 
SCT based approach does not need central control or global 
information. Each agent has a service capability table which 
records a list of acquainted agents and their service capabili- 
ties, which implies that each agent has limited knowledge of 
the environment. Consumer agents request services via broker 
agents which have two SCTs that record information about 
service provider agents and other broker agents. When a broker 


4 Although target tracking in WSNs is used as an example in this paper, 
the proposed approach will not be evaluated in WSNs. This is because our 
approach is developed in a general environment and should not be limited 
in WSNs only. Instead, we evaluate our approach in a general environment 
to ensure that the experimental results are representative in most operating 
environments. 


agent receives a request to carry out a service composition, the 
broker agent contracts services with proper provider agents in 
its SCT if possible, otherwise the broker agent subcontracts to 
other broker agents in its SCT to fulfill the unresolved service 
requirements. 

In this experiment, the performance of the four approaches, 
i.e., our Self-Evo and three above-mentioned representative ap- 
proaches, is evaluated by measuring three quantitative metrics: 
success rate, communication overhead and time consumption. 

1) Success rate is the percentage of successful service 
composition. As success rate is a ratio between the number 
of successful service composition and the total number of 
attempted service composition, it does not have a unit. 

2) Communication overhead is the average number of 
messages used for a successful service composition. The size 
of a message is neglected, as it is the same in all the four 
evaluated approaches in the experiment. Thus, communication 
overhead does not have a unit. 

3) Time consumption ° is the average time consumed for a 
successful service composition. The unit of time consumption 
is millisecond (ms). 


A. Experimental Setup 


The experiment was conducted in both static and dynamic 
environments. The network structure is randomly generated, 
where every two participants are acquaintances of each other 
with a specific probability. The average number of neighbours 
of each agent is set to 8. © In the static environment, the 
number of participants in the environment is fixed, while 
in the dynamic environment, participants can join and leave 
the environment dynamically. In the static environment, the 
experiment was conducted in three scenarios. In the first 
scenario, it is measured that how the performance of the 
four approaches changes with increase of network size. In the 
second scenario, it is measured that how the performance of 
the approaches changes with the variation of task introduction 
probability. Task introduction probability means that at each 
time step, with which probability a task is introduced into the 
environment. In the third scenario, it is measured that how 
the performance of the approaches changes with the variation 
of service allocation probability. Service allocation probability 
means that with which probability a service is allocated to a 
participant. In the dynamic environment, in addition to the 
above three scenarios, there is the fourth scenario, where 
it is measured that how the performance of the approaches 
changes with the variation of the probability of participant 
leaving and joining. This experiment is simulated by using a 
general programming language, Java. Then, ‘agent’ is initially 
programmed as a class and then all agents are the objects 
of this class. Services are simply represented as integers and 
each agent is randomly allocated some of the services. A task 
is programmed as an array. Each element in the array is an 
integer which represents the required service. The number of 
services required by a task is randomly generated. Moreover, 
the experiment is run on an Intel i5-2450m 2.5GHz Windows 


5In service computing, runtime has specific meaning: the running or 
execution of service-based systems. Thus, in the experiment, in order to avoid 
confusion, we use “time consumption” instead of “runtime”. 

6Different average numbers of neighbours, e.g., 4, 6, 8, 10 and 12, 
have been attempted in the experiment. Those different average numbers of 
neighbours, however, do not affect the trend of the performance, i.e., relative 
performance, of the four approaches. We thus use the middle number, 8, in 
the experiment. 
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10 PC with 10GB RAM. The experimental results are obtained 
by averaging 100 runs. Here, we use Java instead of a specific 
agent platform to implement the experiment, because 1) this 
paper mainly studies service composition instead of agent 
technologies which are used as development methods; 2) the 
aim of the experiment is to evaluate the effectiveness of the 
proposed approach; and 3) using specific agent platforms is 
certainly feasible for the implementation of agents but will not 
affect the effectiveness of the proposed approach. Therefore, 
specific agent platforms are not necessary in the experiment. 

The values and meanings of the parameters used in the 
experiment are listed in Table I. These values were chosen 
experimentally to provide the best results. 


TABLE I 
PARAMETERS SETTING 














| Parameters Values Explanations 

| aG ¥ 0.2, 0.3, 0.2 Learning rates 

| P 0.7 Threshold for candidate selection 
l € 0.1 Greedy coefficient 














B. Experimental Results: Static Environment 


1) The first scenario: Fig. 3 demonstrates the performance 
of the four approaches in different network scales. In this 
scenario, the task introduction probability is fixed at 0.3 and 
the service allocation probability is fixed at 0.5. 

In Fig. 3(a), it can be seen that with the increase of network 
scale, the success rates achieved by MIP and MARL decrease 
gradually, while the success rates achieved by SCT and our 
Self-Evo keep relatively steady. This is because both MIP 
and MARL need information of all the participants in the 
network and when the number of participants in the network 
increases, the amount of required information also rises. Thus, 
the computation overhead under MIP and MARL becomes 
huge. Due to the limitation of processing capability, some tasks 
cannot be processed on time, so the success rate decreases. For 
SCT and Self-Evo, no global information is required and each 
participant requests services only from its acquaintances, so 


the increase of network scale does not have much impact on 
SCT and Self-Evo. It should be noted that the success rate 
achieved by our Se/f-Evo is higher than that achieved by SCT. 
This is because under Se/f-Evo, network structures evolve over 
time, where each participant keeps only useful acquaintances 
and services after evolution. Therefore, a service composition 
for a task can be completed faster. In other words, more service 
compositions can be completed in a period by using Self-Evo 
than using SCT. 

In Figs. 3(b) and 3(c), it can be seen that with the increase 
of network scale, both the communication overhead and the 
time consumption under the four approaches rise. For MIP 
and MARL, as the two approaches need global information, 
the central manager in MIP and the focal consumer agent’ in 
MARL has to request information from every participant in the 
network. Thus, as the number of participants in the network 
increases, both MIP and MARL need more communication 
and time to obtain the global information of the network. 
For SCT and Self-Evo, with the increase of the number of 
participants, the average number of acquaintances of each 
participant also increases, which then incurs the increase of 
the communication overhead and time consumption under SCT 
and Self-Evo. Again, as the network structure can be optimised 
under Self-Evo, Self-Evo uses less communication and time 
than SCT. 

2) The second scenario: Fig. 4 displays the performance 
of the four approaches with different number of tasks in the 
network. In this scenario, the number of participants in the 
network is fixed at 500 and the service allocation probability 
is fixed at 0.5. 

In Fig. 4(a), with the increase of task introduction proba- 
bility, more and more tasks exist in the network. As each task 
has a deadline, some tasks may not be processed on time. 
Thus, the success rates achieved by all the four approaches 


7Here, a focal consumer agent is a consumer agent which initialises a 
service composition. 
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decrease. It should be noted that when the task introduction 
probability is larger than 0.7, Self-Evo performs better than 
the other three approaches, while MIP performs the worst. 
This is because with the increase of the number of tasks, 
as in MIP all the processing is based on a single central 
manager, tasks will overwhelm the central manager given 
that the processing capability of the central manager is finite. 
In Self-Evo, as participants can adjust their behaviour with 
change of the environment, the pressure of a large number 
of tasks can be alleviated to some extent. For example, if a 
consumer has a number of tasks to do, it will attempt to buy 
proper services from other participants and it will establish 
acquaintance relationship with those participants who have the 
required services. Such adjust can relax the task pressure. 

In Figs. 4(b) and 4(c), it can be seen that with the in- 
crease of the number of tasks, the communication overhead 
under MIP and MARL keeps relatively steady while the time 
consumption under MIP and MARL increases gradually. In 
MIP and MARL, the communication overhead to complete a 
service composition for a task is independent of the number 
of tasks, as only successfully completed tasks are counted in. 
To successfully complete a service composition for a task, the 
communication overhead for MIP and MARL is almost fixed, 
i.e., the central manager or focal consumer agent broadcasts 
a request message in the network, receives response messages 
and then calculates a solution. The time consumption under 
MIP and MARL increases gradually, because with the increase 
of the number of tasks, service compositions for some tasks 
may not be completed at once and have to stay in the waiting 
list for a while. Thus, the average time used to complete a 
service composition rises. 

For SCT and Self-Evo, the communication overhead and 
time consumption rise with the increase of the number of tasks 
in the network. In SCT and Self-Evo, communication happens 
between service consumers, brokers and providers. When 
the number of tasks increases, available services are more 
and more difficult to obtain, which implies that consumers 
have to find and negotiate with more providers to achieve 
a service agreement. Thus, the communication overhead and 
time consumption to complete a service composition rise. 

3) The third scenario: Fig. 5 displays the performance of 
the four approaches with different service allocation probabili- 
ties. In this scenario, the number of participants in the network 
is fixed at 500 and the task introduction probability is fixed at 
0.3. 

In Figs. 5(a), 5(b) and 5(c), with the increase of service 
allocation probability, the success rates achieved by SCT and 
Self-Evo increase gradually, while the communication and 
time used by SCT and Self-Evo decrease gradually. With 


the increase of service allocation probability, each participant 
possesses more services. Thus, each consumer can complete a 
service composition more quickly and easily, as it needs less 
search and negotiation for a service composition. For MIP 
and MARL, with the increase of service allocation probability, 
the success rates and communication overhead keep relatively 
steady, while the time consumption decreases gradually. The 
service composition under MIP and MARL is, to some extent?, 
independent of the number of services that each participant 
has, as in MIP and MARL, the central manager and the 
focal consumer agent can utilise any available services in 
the environment no matter where the services are. Thus, 
the success rates and communication overhead keep almost 
unchanged. The decrease of time consumption is explained 
as follows. When each participant has few services, tasks 
have to wait for a relatively long time to be served. Thus, 
the average time consumption of each service composition 
is relatively long. This situation, however, is alleviated when 
each participant has more services, so the time consumption 
decreases with the increase of service allocation probability. 


C. Experimental Results: Dynamic Environment 


The experiment was also conducted in dynamic environ- 
ments, where participants may leave or join the environment 
dynamically. In the first, second and third scenarios, it is 
set that after execution of every ten time steps, 5% existing 
participants leave the network and 5% new agents join the 
network with a specific probability. Figs. 6, 7 and 8 demon- 
strate a similar trend on performance variation in dynamic 
environments compared to static environments. In general, 
the performance of the four approaches are less efficient in 
dynamic environments than the performance in static environ- 
ments. This is because for example, in dynamic environments, 
consumers may leave the environments before their tasks are 
completed. These tasks may then become unsuccessful tasks. 
Thus, success rates will decrease. Moreover, if providers leave 
the environments, consumers may have to find other providers, 
so the communication overhead and time consumption will 
increase. Moreover, our Self-Evo has to face the cold-start 
problem due to the use of the trust model. The cold-start 
problem means that when new agents join the network, their 
neighbours do not have information about their performance. 
Thus, the trust against these new agents has to be built up 


8When the task introduction probability is high, e.g., 0.7, the service 
composition under MIP and MARL is not independent of the number of 
services that each participant has. This is because when the number of tasks 
in the environment is large and each participant has only a few services, many 
tasks may not be completed on time due to the lack of enough services in 
the environment. 
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anew. Such cold-start problem has a negative impact on the 
performance of our approach, because along with the cold- 
start problem, the trust model in Self-Evo cannot effectively 
help participants select efficient candidates. 


Comparing Fig. 6(a) with Fig. 3(a), with the increase of 
network scale, the success rate of SCT decreases in the 
dynamic environment, while the success rate of SCT keeps 
relatively steady in the static environment. In SCT, service 
composition is carried out via participants’ neighbours. In the 
dynamic environment, a participant’s neighbours may leave 
the environment before a task is completed. Thus, the success 
rate is affected in a negative way. This situation becomes 


worse in a large scale network, as the more participants in 
the network, the more they may leave (recalling above that 
5% existing participants leave the network with a specific 
probability). When a participant loses most of its neighbours, 
it is difficult for the participant to complete any complex 
tasks. Although new agents may join the network, they are 
randomly scattered in the network and do not have to be the 
neighbours of that participant. Similar to SCT, in our Self-Evo, 
service composition is also conducted through participants’ 
neighbours. However, in our Se/f-Evo, when a participant loses 
its neighbours, the participant can establish new neighbour- 
ship with other participants and also the participant can buy 


required services from other participants. Thus, the negative 
impact of the dynamism on Se/f-Evo can be alleviated to some 
extent via self-evolution. As MIP and MARL utilise global 
information, the dynamism does not have much impact on 
them. 

Comparing Fig. 7(b) with Fig. 4(b), in the dynamic environ- 
ment, when the number of tasks increases, the communication 
overhead of MIP and MARL also increases, while in the 
static environment, when the number of tasks increases, the 
communication overhead of MIP and MARL keeps relatively 
steady. As described above, if providers leave the environment, 
consumers may have to find other providers, which incurs extra 
communication and time consumption. This situation will be 
worse when the number of tasks increases. This is because 
when the number of tasks increases, a provider may serve 
more tasks simultaneously. If this provider leaves, more tasks 
will be affected and thus, more communication and time are 
needed. 

In the exclusive scenario for the dynamic environment, Fig. 
9 demonstrates the performance change of the four approaches 
with the variation of the probability of participant leaving 
and joining. For the four approaches, with the increase of 
the probability of participant leaving and joining, the suc- 
cess rates decrease while the communication overhead and 
time consumption increase. This means that the increase of 
dynamism level of the environment has negative impact on 
the performance of the four approaches. Since the leaving 
of participants implies the failure of participants, consumers 
or the central manager have to spend extra communication 
and time to counter such situation, which is certainly harmful 
for the overall performance. The joining of new partici- 
pants can also offset such situation to some extent, but the 
usefulness of the new participants is limited. For MIP and 
MARL, if providers leave the environment, the joining of new 
participants can increase the probability of completing the 
service compositions which are left by those leaving providers. 
However, the joining of new participants cannot avoid the 
communication and time used by the central manager or focal 
consumers to find new providers. For SCT and Self-Evo, the 
usefulness of the new participants is further limited. In SCT 
and Self-Evo, consumers can only ask acquaintances for help. 
However, these new participants are randomly spread in the 
network, so the consumers, which need new providers for 
service composition, may not find these new participants. 


D. Discussion 


In overall terms, according to the experimental results, the 
proposed approach, Self-Evo, can achieve about 96% of the 
success rates of the centralised approach MIP and the global 
information based approach MARL, but uses about 75% less 
communication and 65% less time than them due to the decen- 
tralised features of Self-Evo. This achievement is significant 
considering that a large amount of communication and time 
are saved. Such huge communication and time consumption 
saving is very important in some real world systems, where 
the energy of each participant is limited? and data has to be 


°The proposed approach is developed in a general environment rather 
than for specific systems or networks. Thus, we did not evaluate the energy 
consumption in the experiment, as energy consumption needs to be evaluated 
within specific networks, e.g., WSNs or mobile ad hoc networks. We, however, 
have evaluated the communication overhead which is an indicator of energy 
consumption. As communication usually consumes energy, especially in 
WSNs, low communication overhead implies low energy consumption. 


processed in a timely manner. The proposed approach, Self- 
Evo, may be computation-intensive, because computation is 
carried out in each of the five stages of the approach. Such 
intensive computation may imply a high time consumption in 
some cases. However, in Self-Evo, the computation is carried 
out by each participant concurrently in a distributed manner. 
Thus, the time consumption is not excessive. Moreover, the 
proposed approach does not require global information. Thus, 
the time for information acquisition can be saved. Therefore, 
the time consumption of Self-Evo stays at a relatively low level 
compared to the other approaches. 

Moreover, Sel/f-Evo is better than SCT in various circum- 
stances. This is mainly because Se/f-Evo allows participants to 
evolve over time, where only useful acquaintances and services 
are kept by each participant. Such evolution can significantly 
reduce communication and time used to find proper providers 
for service composition. 


VI. CONCLUSION AND FUTURE WORK 


This paper proposes an agent-based integrated self-evolving 
service composition approach. This approach is decentralised 
and self-evolvable. Also, unlike most existing approaches 
which study only some of the five stages of service com- 
position, the proposed approach consists of five stages of 
service composition: service discovery, candidate selection, 
service negotiation, task execution and self-evolution. Such 
joint optimisation can make the proposed approach more 
efficient than those which optimise only some stages of service 
composition. 

The proposed approach studies multiple stages of ser- 
vice composition. Each stage has attracted research efforts. 
Compared to those research efforts, the methods used in 
the proposed approach for each stage are effective though 
somewhat simple. Thus, we will focus on the improvement 
and enhancement of these methods so as to achieve better 
performance. First, as described in previous sections, there 
are two limitations in our approach: 1) service presentation 
is simplified, where for each service type, there is only one 
candidate service; 2) the approach is relatively computation- 
intensive in some systems, especially where the computation 
resource is scarce. These two limitations need to be solved 
in the future before the proposed approach can be applied 
in specific systems. Then, as stated in Section V-C, the trust 
model used in our approach has the cold-start problem which 
is a common issue of recommender systems in a network 
environment, especially in a dynamic network environment. 
Currently, there have been a number of algorithms developed 
to handle the cold-start problem, which have been summarised 
in [43]. The development of a new algorithm to address 
the cold-start issue, which is suitable for the service com- 
position problem, is one of our future studies. Moreover, in 
the proposed approach, we do not consider specific control 
logic or workflows, because workflows in service composition 
depend on specific application domains [5], [20] while our 
work is carried out in general environments. Thus, when our 
approach is applied to specific systems, the workflows have 
to be considered as shown in Section II. Finally, energy 
consumption was not evaluated in this paper, as it needs to 
be evaluated within specific networks. The detailed evaluation 
of energy consumption, however, has to be done when the 
proposed approach is applied to specific networks in the future. 
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